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Abstract. Any rule for identifying a quantum system's state within a set of two 
non-orthogonal pure states by a single measurement is flawed. It has a non-zero 
probability of either yielding the wrong result or leaving the query undecided. This 
also holds if the measurement of an observable A is repeated on a finite sample of 
n state copies. We formulate a state identification rule for such a sample. This 
■ rule's probability of giving the wrong result turns out to be bounded from above by 

l/nS\ with 5a = — (A) 2 \ / (A\A + A2A) . A larger 8a results in a smaller upper 
bound. Yet, according to Fleming, 8a cannot exceed tan# with £ (0,7r/2) being the 
angle between the pure states under consideration. We demonstrate that there exist 
observables A which reach the bound tan 8 and wc determine all of them. 
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The transmission of a binary sequence through a sequence of quantum systems whose 
states are to be chosen from a given set {pi, P2} makes it necessary for the recipient 
to identify the states p\ and P2 with as little an error as possible. If each single bit 
is transmitted as a single system, an error minimizing strategy is needed in order to 
identify this system's state from one single measurement. 

If it is to be discriminated between two non-orthogonal states p\ and P2 through 
the measurement of an observable a certain positive lower bound for the probability of 
either wrong or inconclusive state identification cannot be underrun. Such limitations 
of individual state identification have been investigated extensively in theory and 
experiment. For a review see e.g. pp. 

If in contrast each bit is transmitted as a sample of n identical systems, all of them in 
the same state p e {pi, P2} > the minimum error in reading the message correctly reduces 
beyond the limit established for n = 1. More generally, the sequence of values, obtained 
by measuring an arbitrary, perhaps non-optimal observable A on each of the sample's 
members, can be used to lower the probability of an erroneous state identification below 
the one obtained for a single measurement of A. 

We will describe a rule of state identification from the mean value of a general 
observable A in an n-sample. For this rule we derive an upper bound for the probability 
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of error from Chebyshev's inequality associated with the mean value of A. It turns out 
that our rule of state identification produces the wrong result with a probability not 
greater than l/n5\. Here the dimensionless parameter 5 a > not only depends on the 
two states Pid>2 but also on the observable A to be measured on the sample's elements^ 
It is given bj|§| 



5- 



A P1 A + A P2 A' ^ 

This number therefore quantifies how well the states p\ and P2 can be distinguished 
from each other by means of measuring the observable A. 

We will address the issue of which observable A, for given states pi,p2, leads to the 
largest possible value of 5a- Such a choice then minimizes the upper bound l/n5\ of 
the probability of error for a given sample size n, yet it does not need to minimize the 
actual error itself. For arbitrary pure states p\ and P2 we find the maximum of 5a over 
the set of all linear, bounded and self-adjoint operators A. We prove that 

max^ = tan#, (2) 

where 9 with < 9 < n/2 denotes the angle between the states p\ and P2 Ijj] Furthermore, 
among all bounded observables A we explicitly specify those which maximize 5 a- 

The plan of the paper is as follows. In section 2 we summarize some results 
concerning optimal state discrimination for single systems and exhibit their relation 
to our minimization problem. In section 3 we derive the law of large numbers which 
motivates our quest for maximizing 5a- In section 4 we slightly adapt Fleming's 
derivation of the estimate 5a < tan 9 to our goals. This proof will then be used first in 
section 5 to demonstrate that the upper bound tan 9 can be attained and afterwards in 
section 6 to identfy those observables A which actually reach this bound. 



2. State identification for a single system 

How is the state p of a single quantum system to be identified within a given set {pi, P2} 
of two different yet non-orthogonal pure states pi and p 2 ? Is there an observable A which 
when measured upon p allows for identifying the state as either pi or p 2 most 'reliably' ? 

One way to render this question more precisely has been specified by Jaeger and 
Shimony. [2] It meanwhile bears the title 'minimum error state discrimination '. [3] 
Assume that the spectrum of the observable A consists of two eigenvalues ax,a 2 only. 
If the system, whose state is to be identified, is in the state pi with probability pi, then 
a measurement of a fixed observable A upon a randomly chosen state yields a random 

I Since we will vary A and keep the states fixed we refrain from using the more suggestive but 
cumbersome notation 5a {pi,P2) ■ 

§ The notation seems obvious and it is spelled out in sect. 2. 
|| This means that tr (P1P2) = cos 2 9 holds. 
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trial with the composite event space fl A = {pi, P2} x {^1,02} and with the probability 
measure W A whose distribution function p A obeys 

p A (p i ,a j )=p i -tr(p i p£). (3) 

Here P A denotes the orthogonal projection onto the eigenspace of A corresponding to 
the eigenvalue a e {a%, a 2 } . The positive numbers pi,p 2 have to obey p\ + p 2 = 1. In 
order to correlate the measurement outcome a, with the random state p^ as strongly as 
possible one has to search for an observable A which maximizes the probability 
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W A (D) = ^-tT{p t P A ) (4) 
i=i 

of the 'detection event' D = {(pi, 01) , (P2, a 2 )} . 

Since the states p\, p 2 are supposed to be pure, there exist unitvectors ipi £ fi such 
that pi = ipi (ipi, •) for i 6 {1,2}. Assume now tentatively that tr (^piP A ^j = for all 
pairs with i 7^ j. This implies P A ipj = for i 7^ j and therefore Aipi = a^i for 
all i. But this leads to (ipi,ip2) = which contradicts the assumed non-orthogonality 
tr (P1P2) 7^ 0. Therefore the detection event D cannot be certain whatever choice of A 
is made. Rather Jaeger and Shimony have proven in [2] that the maximum of W A (D) 
obeys 

max {W A (£>)} = ^1 + - 4 Pl p 2 • cos 2 tj (5) 

with 9 G (0, 7r/2) such that cos 2 9 = \(ipi,ip2)\ 2 = tr (pip 2 ) ■ Here the maximum is 
taken over all linear, bounded and self-adjoint operators A : % — > H whose spectrum 
consists of two fixed (unequal) eigenvalues ai, a 2 only. It comes with little surprise that 
max^ \ W A (-D)} does not depend on the choice of eigenvalues (a±, a 2 ) . 

A related maximization problem is the following one. Find a linear, bounded and 
self-adjoint operator A : 7-L — > 7-L such that firstly the spectrum of A consists of the 
eigenvalues a\ = a > 0, a 2 = —a and secondly A maximizes the weighted difference of 
expectation valus, i.e. A := p\ (A) 1 — p 2 (A) 2 with (A) i = tr {p%A) . Because of 

A = ap A (pi, a) + ap A (p 2 , -a) - ap A (pi, -a) - ap A (p 2 , a) (6a) 
= a ■ (2W A (D) — l) (6b) 

this maximization problem for constant a is equivalent to the previous one of maximizing 
W A (D). 

A genuinely alternative maximization problem is posed by the following one, which 
is known as 'unambiguous state discrimination'. [3] Assume now that the spectrum 
of the observable A consists of three (different) eigenvalues a ,ai,a 2 . Then the above 
probability space (p, A ,W A ) is replaced by the event space il A = {pi,p 2 } X {00,01,02} 
with the modified probability measure W A whose distribution function p A obeys 

p A {p h a j ) = Pi -tr ( Pi P A ) . (7) 
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If one now chooses A in such a way that 

p A (pi,a 2 ) = = p A (p 2} a l ) } (8) 

then, whenever the event {pi, p 2 } x {cti} occurs, it follows that p = pi. Thus, under 
these provisions, the state can be determined with certainty, whenever a measurement 
of A yields one of the values a\ or a 2 . Note that for D = {(pi, ai) , (p 2 , o 2 )} holds 

W A (D) = W A {{p u p 2 } x { ai , a 2 }) = 1-W A {{p u p 2 } x {a }) . (9) 

Yet, as above, the event D = {(pi, ai) , (p 2 , a 2 )} , allowing for a correct state- 
identification, cannot be certain. Therefore one is led to search for those observables A, 
for which in addition to the validity of equation (jSJ) the probability W A (D) is maximal. 
Jaeger and Shimony [2] have proven for dim (Ti) > 3 that 



l-2v^Ip2~-cos# for J "tote**} >cosd 

r „ T A, n \l J V ^ y max{pi,p 2 } — , , 

max {W 7 (£>)} = ^ (10) 

max {p!,p 2 } sin 2 for J ZZtnl^ < cos 



max{pi,p2} 

Here the maximization is performed over all those linear, bounded operators A, whose 
spectrum contains three different eigenvalues only, and which obey equation (JS}. 

In this work we shall consider a third maximization problem. Among all linear, 
bounded and self-adjoint operators A : H — > H we determine those which maximize 
the number 5a given by equation (JT|) for two arbitrary but fixed non-orthogonal, non- 
identical, pure state density operators pi, p 2 : H — > %. Here 

A p A = y/(A\ - (A) 2 p with (X) p = tr (pX) (11) 

denotes the uncertainty of A in the state p. The number 5a is invariant under the shift 
A A+pj- idu for any real p and also under the rescaling A — > \A for any non-zero real 
A. It relates the distance between the states' expectation values to their uncertainties 
and therefore has been proposed by Fleming [4] as a quantifier of the distinguishability 
of the states p\ and p 2 by means of measuring A on a finite sample. 
Part of our result is 

max {5a} = tan#, (12) 

A 

where A is allowed to run through the set of all linear, bounded, self-adjoint operators 
A : 7-L — > H. Observe that no further restriction on the spectrum of A is imposed. In 
particular, the spectrum of A may include a continuous part. 

Among the observables A maximizing 5a we shall identify one which also maximizes 
(A) — (A) . It is given bjjjjl A = (pi — p 2 ) / sin.6. This operator therefore also 
maximizes the probability of correct state identification W (D) for p\ = p 2 = 1/2 from 
equation (JSJ). Its value is given by 

max{W A (D)} = -(l + sin0). (13) 
1 The spectum of A is {1, -1, 0} if dim (W) > 3 and {1, -1} if dim (H) = 2. 
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In deriving equation (TT2|) we make use of a powerful estimate due to Fleming 
[I], that is conceived purely by general algebraic deliberations. Fleming called it a ' 
quantum master inequality', because he was able to derive a host of other well known 
quantum theoretical facts from it. Besides taking the orthogonality of two eigenvectors 
to different eigenvalues of an observable to a more general and quantitative level, 
Fleming's quantum master inequality also implies Robertson's generalized uncertainty 
relation 2AAAB >\([A,B])\.$ 

Fleming's quantum master inequality states that, whenever A Pl A + A P2 A > 0, then 
5a < tan 9. We shall first prove that Fleming's upper bound can be reached and then 
identify necessary and sufficient conditions on A for 5 a = tan 9 to hold. 

Before entering the problem of maximizing 5a we will clarify the role of 5a in 
identifying the state from an n-sample of states p G {pi, P2} • We shall do so in the more 
general context of identifying a probability measure W on the real line within a set of 
two options {W%, W2} ■ 



3. State identification for an n-sample 

Let W denote a probability measure on the real line with finite expectation value 
X := K w (id^) and variance A 2 := K w ((idj& — X) 2 ) . Chebyshev's inequality states 
that for any t G M>o holds 

W({x G R : \x-X\ > t}) < (^j . (14) 

The product space M n together with the product measure W n = W x . . . x W 
corresponds to the random experiment of drawing n real numbers independently and 
each one distributed by W. The mean value of such a sample (xi, . . . x n ) G W 1 is given 
by the function m n : IR n — > K with 

1 v— \ n 

m n (xi, ...x n ) = -) x^ (15) 

For the expectation value and the variance of the random variable m n under the measure 
W n holds 

A 2 

E^yn (m„) = X and Yw n (mn) = — • (16) 

n 

Application of Chebyshev's inequality to m n thus yields the following law of large 
numbers 

1 /A x 2 



W n ({uj G R n : \m n (u) - X\ > t}) < - ■ J (17) 

The probability that the mean value of a random sample u G M n of the distribution W n 
deviates from the expectation value by at least a fixed value t > converges to when 
n goes to 00. 

Now, let W\ and Wi denote two different probabilty measures on the real line of 
the above type. Their expectation values X$ are assumed to be unequal and without 



Reaching Fleming's discrimination bound 



6 



loss of generality we may assume X 2 > X\. We also suppose that at least one of the 
probability measures Wi has non-zero variance, i.e. that Ai + A 2 > 0. 

A sample u> G M. n of n real numbers is supposed to be generated by either the 
distribution or W 2 - From the sample's mean value one may try to guess whether 
the sample has been generated by or W 2 . To this end, observe first that as a 
consequence of Chebyshev's inequality (IT7|) we have for all ti,t 2 £ ^>o 

W? ({u eR n :m n (to) > X 1 + h}) < i • (j^j , (18a) 

({u eR n :m n (u) < X 2 - t 2 }) < - ■ ( ^] . (186) 

n \t 2 J 

Choosing now the numbers ti according to 

U = A, • 5 with 5 = f 2 ;^ 1 > (19) 
i\i + 1\ 2 

the estimates ( 11 8 aft and ( 1186)1 turn into 

({w eR n :m n (u) > X }) <--(^] , (20a) 



n \5 

H\ 



({w £l":m„ (w) < X }) < - • f ~ ) . (206) 



Here the point 



X = . A2 t- Xi + t Al A X 2 (21) 
Ax + A 2 Ax + A 2 1 ; 

divides the interval X 2 ] into a portion of length A ^. 1 A • (X 2 — X\) to the left of X 
and another one of length A ^ A • (X 2 — Xi) to the right of X . Observe that 

* + • <*» - = A ' 2 - A^S ' <** - A ''> = (22) 

Let a sample w e K n be generated with probability p\ > through the measure 
W-P or with probability p 2 = 1 — pi > through the measure W 7 ^. This corresponds to 
the composite random trial with event space Q = {1, 2} x IR n with the product measure 
W which obeys for all measurable Z <zM. n 

W{{i}xZ)= Pi -WT(Z). (23) 

Let E denote the event that the sample u is either generated by W 2 and yields a value 
fn-n (w) < X or is generated by and has a mean value m n (co) > X . Then this 
event's probability is bounded by 

1 /1\ 2 1 /1\ 2 1 /l x2 



Hence, by increasing n, this event's probability can be made arbitrarily small. 

Thus we have the result: The event that u is generated by and has a mean 
value m n (to) < X or that to is generated by W 2 and has a mean value m n (to) > X 
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has a probabilty greater or equal to 1 — - • Q) 2 • F° r n $ 2 > l/s 1 this implies that the 
event m n (cj) < X is caused by and m n (cj) > X is caused by W£ has probability 
greater than 1 — e and so is virtually certain. This fact justifies the identification of 
a sample's generating distribution by means of the following criterion: if the sample's 
mean value obeys m n (ui) < X , then the sample uj is assumed to have been generated 
by W™. If, however, m n (cj) > X , then the sample is assumed to have been gererated 
by W£. 

Besides the sample's size n the positiv real number 

S = (25) 

is decisive for the correct identification of the sample-generating distribution from 
the sample's value m n (u) with high probability. The two distributions and are 
identified correctly with the higher a probability the larger the value of S. One might 
call the parameter 5 of two probability measures W\ and Wi on the real line their 
discernability. 

As is well known, any pair (p, A) of a quantum state p : "H — > H and a bounded 
observable A : T-i — > H defines a probability measure W A on R, which has its 
support on the spectrum of A. For any measurable set Z C R the number W A (Z) 
equals the probability that when A is measured on p the measured value belongs to 
Z. The expectation value and variance of ids. under W A equal tr (pA) = (A) and 
tr (pA 2 ) — tr {pA) 2 = (A p A) 2 . Thus the rule of identifying a probability measure 
W € {WijWjj} from an n-sample of measured values (ui,...u n ) can be taken over 
in a straight-forward manner to the quantum case by replacing W{ with the probability 
measure W A . Identification of Wi then amounts to an identification of p, t and the 
discernibility 5 specializes to the expression given by equation ([T]). 

4. Fleming's quantum master inequality 

Let H denote a separable Hilbert space and let the linear mapping A : H — > % be 
bounded and self-adjoint. The expectation value of A in the pure state represented 
by a unit vector v G "H is denoted as (A) v The following quantum master inequality 
(QMIE) relates two pure states through their first two moments of an observable. It 
has been given by Fleming in [1]. 

Proposition 1 For any two unit vectors v , w £ 7i and any linear, bounded and self- 
adjoint operator A : 7-L — > 7-L there holds 

\(A) W - (A) v \ ■ \(w,v)\ < (A V A + A W A) • ^l-\(w,v)\ 2 . (26) 



+ Thus (A) v = (v,Av), and A V A = y (A 2 ) v — (A) 2 v denotes the uncertainty of A in the state 
represented by v. 
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Before proving this estimate we discuss a few of its consequences. Observe first 
that there exists a unique 9 e [0,7r/2] such that \(w,v)\ = cos 9. Squaring the inequality 
( 126]) and slightly rearranging terms yields 

[{{A) w - (A) v ) 2 + (A V A + A W A) 2 ] cos 2 9 < (A V A + A W A) 2 . (27) 

Whenever the term in the square brackets is non-zero, then (126]) is equivalent to 

cos* < (A-A + ^Af 

" \(A) w -{A)f + (A„A + A w A) 2 
For A V A + A W A > inequality ( 128]) is equivalent to 

cos 2 9 < — ^— with 5 = > °- ( 29 ) 

_ 1 + S 2 A V A + A W A ~ v 7 

The bound (129]) for cos 2 is strictly monotonically decreasing from 1 to when 
5 moves from to oo. The number 8 quantifies the distinguishability of the states 
represented by v and w through measuring A. For non-orthogonal vectors v and w the 
inequality (129]) is equivalent to 

5 2 < -\- - 1 = tan 2 9. (30) 
cos z 9 

Thus, for cos 9 > and A V A + A„,v4 > the estimate ( 126]) is equivalent to 

5<tan#. (31) 

If v and u> are parallel, then both sides of the inequality (126]) take the value due 
to (A) w = (A) v and \(w,v)\ = 1. The inequality (126]) is thus saturated in this case for 
any A. If v and w are orthogonal, the estimate (1261) reduces to 

0<A„A + A w A (32) 

This estimate is saturated if and only if A V A = = A W A, which in turn holds if and 
only if both v and w are eigenvectors of A. 

We shall now give a proof of Fleming's quantum master inequality ( 126]) . 
Proof. In a first step we decompose the vector Av into a vector parallel to v and one 
orthogonal to v. This unique decomposition reads 

Av — (A) v v + (Av — (A) v v) , (33) 

since (v, Av — (A) v v) = 0. Observe that the component v a = Av — (A) v v of Av 
orthogonal to v has the norm A V A since 

\v A \ 2 = (Av - (A) v v,Av - (A) v v) = (A v Af . (34) 

We thus have 

(w,Av) = (A) v (w,v) + (w,v A ). (35) 
The analogous decomposition of Aw = (A) w w + wa with wa = Aw — (A) w w yields 

(Aw,v) = (A) w (w,v) + (w A ,v). (36) 
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Since (w,Av) = (Aw,v) we obtain 

((A) w - (A) v ) (w, v) = (w, v A ) - (w A , v) . (37) 

Taking the absolute value from both sides and applying the triangle inequality on C 
gives the estimate 

| (A) w - (A) v \ ■ cosfl = \(w, v A ) - (w A , v)\ < | (w, v A )\ + | (w A , v) | , (38) 

where 9 G [0,7r/2] is uniquely defined through cos^ = \ {w,v) \ . 

Since v A is orthogonal to v, by means of the decomposition of w into a vector from 
C • v and one from its orthogonal complement (C • v) ± according to 

w — (v,w) v + (w — (v, w) v) , (39) 

we obtain the equality 

(w,v A ) = (w - (v,w)v,v A ) . (40) 

After taking the absolute value from both sides the Cauchy-Schwarz inequality in H. 
gives 

| (w, v A ) I = \{w — (v, w) v, v A ) I < \w — (v, w) v\ \v A \ = sin 6* • A V A. (41) 

Interchanging v and w leaves us with 

| (w A , v)\ = | (w A , v - (w, v)w)\< sin 9 ■ A W A. (42) 

Inserting these bounds into the right-hand side of estimate (J38l then leads to the 
statement of prop. [□, 

\(A) w - (A} \ -cos6< (A V A + A W A) ■ sinO. (43) 



5. Conditions for saturating the QMIE 

When H = hh denotes a Hamiltonian, the estimate ( 128]) with w = vt '■= e~ lht v produces 
an upper bound for the survival probability P v (t) = \ (v,v t )\ 2 of the initial state v (v, ■) 
under the time evolution up to time t, which is equivalent to 

(A v A + A v ,Af+\{A) n -{A) v \ 

Searching for an observable A which minimizes J Q T Q (t) dt for a period of time T of a 
spin-l/2-system, we have found the following necessary and sufficient conditions on A 
to saturate the QMIE. [S] 

Proposition 2 Let v,w be unit vectors in a Hilbert space T-L with |(it?,u)| = cos# for 
some 9 e (0, ir/2] , i.e. v and w are assumed to be linearly independent. Let A : T-L — ^ % 
be a linear, bounded and self-adjoint. Then, the QMIE for the states pi = v (v, ■) and 
P2 = w (w, •) is saturated, i.e, the equation 

\(A) W - (A) v \cos9 = (A V A + A W A) sin 9 (45) 

holds, if and only if the conditions (i) and (ii) are fulfilled. 
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(i) The operator A leaves the subspace C • v + C • w invariant. 

(ii) The equation 

(w,Av) = X (w,v) (46) 
holds for some AgI with 

mm{(A) w , (A)J < A < max{(A) w , (A) J . (47) 

For 8 = the QMIE is saturated for any observable A since both sides of the QMIE 
are zero. Thus, the proposition has to deal with the non-trivial case < 9 < n/2 only. 
Proof. The proof of proposition [1] contains three estimates. The first one is (|38|) . It 
uses the triangle inequality for complex numbers as follows 

| (w, v A ) - (w A , v)\ < I (w, v A )\ + \ (w A , v) I . (48) 

Here, equality holds if and only if the complex numbers (w, v A ) and — (w A , v) as elements 
of M 2 ~ C point into the same direction. This is the case if and only if there exists a 
pair (a, (3) G (M> x R> ) \ (0, 0) such that 

a (w, v A ) + (w A , v) = 0. (49) 

The other two estimates are contained in (T4T1) and ( 14*21 . They employ the Cauchy- 
Schwarz inequality for the scalar product of two elements of H. The estimate (T4T1) thus 
is saturated if and only if the vector v A is a (complex) multiple of the (non-zerojfl vector 
w — (v, w) v, i.e., if 

v A G C • (w — (v, w) v) . (50) 

Similarly, the estimate (1421) is saturated if and only if 

w A G C • (v — (w, v) w) . (51) 

Therefore, the equality ( )45l) holds if and only if all three conditions ( ]49l . ( |50l) . and 
( 1511 are fullfilled. The conditions ( 1501) . and ( 15"TI) hold, if and only if A maps the space 
which is spanned by v and w onto itself. This can be seen as follows: (|50|) implies^ that 
Av G C ■ v + C ■ w and (15"TT) implies that Aw G C - v + C - w. On the other hand, since 
v A is by definition orthogonal to v and w A is orthogonal to w, the conditions (|50|) and 
([EE]) follow from A(C-v + C-w) C (C-v + C-w). 

We now have to address condition (1491) . This condition says that there exists a pair 
(a, /3) G (M> x R> ) \ (0, 0) such that 

= a (w, Av - (A) v v)+/3 (Aw - (A) w w, v) (52a) 
= a (w, Av) + /3 (Aw, v) - a (A) v (w, v) - (3 (A) w (w, v) (526) 
= {a + (3) (w, Av) - (a (A) v + (3 (A) J (w, v) . (52c) 

* Due to 9 > we have w — (v,w)v^0^v — (w, v) w. 

t) Observe that it is here that we need that v and w are linearly independent, i.e. that 8 > 0. In case 



of 9 = a condition on va does not follow from saturating estimate (|4T 
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Since a + (3 > 0, it follows that condition (149 j) holds if and only if there exists a pair 
(a, /3) G (M> x M> ) \ (0, 0) such that 

^M=(^(A) V + ^{A) W ) M . (53) 
Due to ^) G ([0, 1] x [0, 1]) s (0, 0) , and ^ + ^ = 1, the real number 

* = (^— Q (A) v + -A- Q (A) w ) (54) 



a + /3 x ,v a + /3 

is a convex combination of (A) v and (A)^ • Thus condition (1491) holds if and only if 
(w, Av) is a real multiple of (w, v) , where the factor belongs to the interval bounded by 
(A) v and (A) w . Thus we have proven equation ( )46i) . ■ 

Observe that in case of cos 6 = the pair (v,w) is an orthonormal basis of the 
space C • v + C • w. Then, the equation ( 1451) holds if and only if A stabilizes the subspace 
C - v + C • w and (w, Av) = 0. This in turn is equivalent to the statement that v and w 
both are eigenvectors of A, because of Av = (v, Av) v+ (w, Av) w = (A) v v and similarly 
Aw = (A) w w. If on the other hand for < 9 < n/2 we have A V A + A W A = it follows 
that (A) w = (A) v and that v and w are eigenvectors of A with the same eigenvalue. 

Thus the nontrivial case of equation (l4"5l) is realized ifO < 9 < n/2 and 
A V A + A W A > is valid. In this case the equality 5a = tan 6* holds if and only if 
the conditions (i) and (ii) are fulfilled. 



6. Observables of maximal 5a 

We shall now determine the set of observables A which for given states p\ = v (v, ■) , 
and p2 = w (w, ■) obey A W A + A V A > and 5a = tan 9. Let v , w G H be unit vectors 
with < \(v, w)\ < 1 and let A : H — > H be linear, bounded and self-adjoint. Without 
loss of generality we assume that (A) v < (A) w and that \(v,w)\ = (v,w) . According to 
prop. [2] the equation 5a = tan 9 holds if and only if 

(i) A stabilizes C • v + C • w 

(ii) The quotient is real and obeys (A) v < < (A) w . 

Since A is self-adjoint, condition (i) implies that A stabilizes the orthogonal 
complement of C • v + C • w too. Therefore, the action of A on this complementary 
subspace [C • v + C • w} ± has no relevance to our problem and it is the restriction Aq of 
A to "H '■= C ■ v + C ■ w only which has to be studied. 

Since SxA+^du = 5a holds for all A G K \ and \i G M and for all A with 
A W A + A V A > 0, we may use this freedom of shifting and rescaling A in such a way 
that the spectrum of A obeys a (A ) = {1, —1} . This is clearly equivalent to 

tr (Aq) = and det (A ) = -1. (55) 

Observe that, because of < A W A + A V A = A W A + A V A , a transformation into 
A = idu is impossible. 
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Among the observables A : "H — ► Ho wich obey (I55p we now search for those 
which satisfy 

To do so we introduce the following orthonormal basis in "Ho : 

V + w w — V , . 

The vectors v and w thus have the decomposition 

w = cos ( ^ J • ei + sin f ^ J • e 2 and t> = cos ( ^ J • e 1 - sin J ■ e 2 .(58) 



The matrix elements of Ao with respect to e = (ei,e2) are denoted as A, 



v 



(ei,A ej) . Clearly, An G R and Ai 2 G C with A 2 \ = A i2 . Condition f|55|) is equivalent 
to 

An = ~A 22 and A?! + |A 12 | 2 = 1. (59) 
For the matrix elements involved in (156]) we find 

= cos (9) A n - sin (9) 3? (Ai 2 ) , (60a) 
(w,Au) = A u -isin(0)3(Ai 2 ), (605) 
(w, Aw) = cos (0) A xl + sin (9) K (A i2 ) . (60c) 

Condition (156|) therefore implies that 

Qf(Ai 2 ) = and U(A l2 ) > 0. 

Due to A\ x + Af 2 = 1 there exists a unique a G [0, n] with 
An = cos a and A 12 = sin a. 
Using this parametrization the matrix elements of Aq obey 

(v,Av) = cos (9 + a) , (61a) 
(u>,At>) =cosa, (616) 
(w,Aw) = cos (9 — a) (61c) 

Condition ( 1561) thus implies 

cos (6 1 ) cos (6* + a) < cos a < cos (#) cos (9 — a) . (62) 

which, due to cos (9) cos (0 + a) = [cos (a) + cos (29 + a)} /2, is equivalent to 

cos (29 + a) < cos a < cos (29 -a). (63) 

On the domain (9, a) G (0, 7r/2) x [0, 7r] condition ( 1631) is equivalent to 

< a < 7T - 9. (64) 

Finally, it is now easy to prove that for any linear, bounded and self-adjoint map 
A : H — > H which stabilizes "Ho and whose restriction A to "Ho obeys 

Aq = cos (a) [ei (ei, ■) - e 2 (e 2 , ■)] + sin (a) [e x (e 2 , ■) + e 2 (ei, ■)] (65) 
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for some a G [8,71 — 6} there holds 5a = tan 6. To do so we first derive from (16 1 a\) and 

fEEII 

(w, Aw) - (v, Av) = 2 sin (9) sin (a) (66) 

and then observe that 

(A V A) 2 = l-(A) 2 v = sin 2 (9 + a), (67) 
(A W A) 2 = 1 - (A)l = sin 2 (9 -a). (68) 

From this it follows that 

_ (w, Aw) - (v, Av) 2 sin (0) sin (a) 

A ~ A W A + A V A " ^/ S i n 2 (6-a) + ^sin 2 (9 + a) 
Since < a — 9 < ix and 0<# + a<7rwe have 



tan0, (72) 



'sin 2 (6» - a) = sin (a - 9) and ^/ sin 2 (9 + a) = sin (a + 9) (70) 
and therefore 

2 sin (6 1 ) sin (a) /nS ... 

S A = = tan (9) . (71) 

2 sin (a) cos (0) v ; v ; 

We may now summarize our result as follows. 

Proposition 3 Let v, w be unit vectors in a Hilbert space ~H with (v, w) = cos 9 for some 
9 G (0, 7r/2) . A linear, bounded self-adjoint operator A : H — > % with A W A + A V A > 
reaches Fleming's bound, i.e. obeys 

s . \(A) W -(A) V \ 
A ' A W A + A V A 
if and only if 

(i) A stabilizes the subspace T-Lq = C- v + C-wC'H and 

(ii) the restriction of A to ~Ho is related to an operator Aq from the set 

{cos (a) (E u - E 22 ) + sin (a) (E 12 + E 21 ) \a G [9, n - 9] } (73) 

through A\ n = XA + ^idu Q for some A G R \ and fi G R. Here Eij := (e 3 -, •) , with 
the vectors from equation (51\ ). 

Observe from equation ( |66l) that for a = tt /2 the expectation values' difference 
(A) — (A) v is maximal. The maximal difference has the value 2sin#. From equation 
( |66l) we then obtain for pi = p 2 = 1/2 that 

sinfl = 2W A (D) - 1. (74) 

Thus we have W A (D) — ~ (1 + sin^) , which coincides with the result of Jaeger and 
Shimony [2 J stated in equation (j5]). The corresponding observable Aq has the following 
particularly simple form 

a 1 \ 1 \ w (w, ■) — v (v, ■) 

A = e 1 (e 2 ,-) + e 2 (e 1 ,-) = . Q ■ 7 5 

sm9 
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